Ending | Count |
---|---|
؟ | 156 |
الماضي. | 45 |
العراق. | 28 |
أخرى. | 28 |
ذلك. | 27 |
فقط. | 26 |
البلاد. | 26 |
العالم. | 26 |
الله. | 21 |
المتحدة. | 21 |
فيها. | 21 |
اليوم. | 21 |
منها. | 20 |
عليها. | 19 |
سنوات. | 18 |
المقبل. | 18 |
العربية. | 18 |
المنطقة. | 17 |
عليه. | 17 |
لها. | 16 |
غزة. | 16 |
اليمن. | 15 |
الدولة. | 15 |
دولار. | 15 |
العربي. | 15 |
له. | 14 |
سوريا. | 14 |
المدينة. | 14 |
العام. | 14 |
الفلسطينية. | 13 |
In the next four subsections show the most frequent sentence endings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the end of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', -1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.1 Most Frequent Sentence Beginnings I
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV